AITopics | random split

Collaborating Authors

random split

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

fb60d411a5c5b72b2e7d3527cfc84fd0-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 05:31:59 GMT

dataset, graph, node, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
(2 more...)

Add feedback

A Honest Cross-Validation Estimator for Prediction Performance

Pan, Tianyu, Yu, Vincent Z., Devanarayan, Viswanath, Tian, Lu

arXiv.org Machine LearningOct-10-2025

Cross-validation is a standard tool for obtaining a honest assessment of the performance of a prediction model. The commonly used version repeatedly splits data, trains the prediction model on the training set, evaluates the model performance on the test set, and averages the model performance across different data splits. A well-known criticism is that such cross-validation procedure does not directly estimate the performance of the particular model recommended for future use. In this paper, we propose a new method to estimate the performance of a model trained on a specific (random) training set. A naive estimator can be obtained by applying the model to a disjoint testing set. Surprisingly, cross-validation estimators computed from other random splits can be used to improve this naive estimator within a random-effects model framework. We develop two estimators -- a hierarchical Bayesian estimator and an empirical Bayes estimator -- that perform similarly to or better than both the conventional cross-validation estimator and the naive single-split estimator. Simulations and a real-data example demonstrate the superior performance of the proposed method.

err 0, err cv 0, estimator, (14 more...)

arXiv.org Machine Learning

2510.07649

Country:

North America > United States > Indiana (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Cross Validation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

fb60d411a5c5b72b2e7d3527cfc84fd0-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 09:34:31 GMT

data mining, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
(2 more...)

Add feedback

BenchMake: Turn any scientific data set into a reproducible benchmark

Barnard, Amanda S

arXiv.org Artificial IntelligenceJul-1-2025

Benchmark data sets are curated collections that enable consistent, reproducible, and objective evaluation of algorithms and models [1, 2]. They are essential for comparing algorithm performance fairly, particularly in machine learning (ML) and artificial intelligence (AI), where the suitability of algorithms can vary widely based on data structure, dimensionality, and distribution [3, 4]. For instance, algorithms that perform exceptionally on structured, tabular data may not generalise well to unstructured image or textual data [5]. Established benchmarks such as ImageNet [2], CIFAR data sets [6], and OpenML benchmarks for structured data [7] have driven innovation by providing clear metrics for progress, fostering reproducibility and trust within the research community [8]. However, in computational sciences, standardised benchmarks remain rare and challenging to establish due to the intrinsic complexity, heterogeneity, and domain specificity of scientific data [9]. Scientific data sets can be represented in a variety of ways (tables, image, text, graphs, signals), often requiring extensive pre-processing, specialised evaluation metrics, and are subject to measurement noise, natural variability, and data imbalance [10].

artificial intelligence, benchmake, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2506.23419

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia (0.04)
North America > United States > Wisconsin (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Banking & Finance (0.68)
Health & Medicine > Diagnostic Medicine (0.67)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Cai, Hao-Run, Ye, Han-Jia

arXiv.org Artificial IntelligenceFeb-27-2025

Deep tabular models have demonstrated remarkable success on i.i.d. data, excelling in a variety of structured data tasks. However, their performance often deteriorates under temporal distribution shifts, where trends and periodic patterns are present in the evolving data distribution over time. In this paper, we explore the underlying reasons for this failure in capturing temporal dependencies. We begin by investigating the training protocol, revealing a key issue in how model selection perform. While existing approaches use temporal ordering for splitting validation set, we show that even a random split can significantly improve model performance. By minimizing the time lag between training data and test time, while reducing the bias in validation, our proposed training protocol significantly improves generalization across various methods. Furthermore, we analyze how temporal data affects deep tabular representations, uncovering that these models often fail to capture crucial periodic and trend information. To address this gap, we introduce a plug-and-play temporal embedding method based on Fourier series expansion to learn and incorporate temporal patterns, offering an adaptive approach to handle temporal shifts. Our experiments demonstrate that this temporal embedding, combined with the improved training protocol, provides a more effective and robust framework for learning from temporal tabular data.

splitting strategy, temporal split, validation, (17 more...)

arXiv.org Artificial Intelligence

2502.2026

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback

Excited-state nonadiabatic dynamics in explicit solvent using machine learned interatomic potentials

Tiefenbacher, Maximilian X., Bachmair, Brigitta, Chen, Cheng Giuseppe, Westermayr, Julia, Marquetand, Philipp, Dietschreit, Johannes C. B., González, Leticia

arXiv.org Artificial IntelligenceJan-28-2025

Excited-state nonadiabatic simulations with quantum mechanics/molecular mechanics (QM/MM) are essential to understand photoinduced processes in explicit environments. However, the high computational cost of the underlying quantum chemical calculations limits its application in combination with trajectory surface hopping methods. Here, we use FieldSchNet, a machine-learned interatomic potential capable of incorporating electric field effects into the electronic states, to replace traditional QM/MM electrostatic embedding with its ML/MM counterpart for nonadiabatic excited state trajectories. The developed method is applied to furan in water, including five coupled singlet states. Our results demonstrate that with sufficiently curated training data, the ML/MM model reproduces the electronic kinetics and structural rearrangements of QM/MM surface hopping reference simulations. Furthermore, we identify performance metrics that provide robust and interpretable validation of model accuracy.

artificial intelligence, machine learning, trajectory, (15 more...)

arXiv.org Artificial Intelligence

2501.16974

Country:

Europe > Austria > Vienna (0.14)
North America > United States (0.14)
Europe > Germany > Saxony > Leipzig (0.04)
Europe > Italy (0.04)

Genre: Research Report > New Finding (0.86)

Industry:

Energy (0.46)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Enhancing Drug-Target Interaction Prediction through Transfer Learning from Activity Cliff Prediction Tasks

Ibragimova, Regina, Iliadis, Dimitrios, Waegeman, Willem

arXiv.org Artificial IntelligenceDec-11-2024

Recently, machine learning (ML) has gained popularity in the early stages of drug discovery. This trend is unsurprising given the increasing volume of relevant experimental data and the continuous improvement of ML algorithms. However, conventional models, which rely on the principle of molecular similarity, often fail to capture the complexities of chemical interactions, particularly those involving activity cliffs (ACs) - compounds that are structurally similar but exhibit evidently different activity behaviors. In this work, we address two distinct yet related tasks: (1) activity cliff (AC) prediction and (2) drug-target interaction (DTI) prediction. Leveraging insights gained from the AC prediction task, we aim to improve the performance of DTI prediction through transfer learning. A universal model was developed for AC prediction, capable of identifying activity cliffs across diverse targets. Insights from this model were then incorporated into DTI prediction, enabling better handling of challenging cases involving ACs while maintaining similar overall performance. This approach establishes a strong foundation for integrating AC awareness into predictive models for drug discovery. Scientific Contribution This study presents a novel approach that applies transfer learning from AC prediction to enhance DTI prediction, addressing limitations of traditional similarity-based models. By introducing AC-awareness, we improve DTI model performance in structurally complex regions, demonstrating the benefits of integrating compound-specific and protein-contextual information. Unlike previous studies, which treat AC and DTI predictions as separate problems, this work establishes a unified framework to address both data scarcity and prediction challenges in drug discovery.

artificial intelligence, machine learning, similarity and affinity threshold, (15 more...)

arXiv.org Artificial Intelligence

2412.19815

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.87)

Add feedback

TabReD: A Benchmark of Tabular Machine Learning in-the-Wild

Rubachev, Ivan, Kartashev, Nikolay, Gorishniy, Yury, Babenko, Artem

arXiv.org Artificial IntelligenceJul-1-2024

Benchmarks that closely reflect downstream application scenarios are essential for the streamlined adoption of new research in tabular machine learning (ML). In this work, we examine existing tabular benchmarks and find two common characteristics of industry-grade tabular data that are underrepresented in the datasets available to the academic community. First, tabular data often changes over time in real-world deployment scenarios. This impacts model performance and requires time-based train and test splits for correct model evaluation. Yet, existing academic tabular datasets often lack timestamp metadata to enable such evaluation. Second, a considerable portion of datasets in production settings stem from extensive data acquisition and feature engineering pipelines. For each specific dataset, this can have a different impact on the absolute and relative number of predictive, uninformative, and correlated features, which in turn can affect model selection. To fill the aforementioned gaps in academic benchmarks, we introduce TabReD -- a collection of eight industry-grade tabular datasets covering a wide range of domains from finance to food delivery services. We assess a large number of tabular ML models in the feature-rich, temporally-evolving data setting facilitated by TabReD. We demonstrate that evaluation on time-based data splits leads to different methods ranking, compared to evaluation on random splits more common in academic benchmarks. Furthermore, on the TabReD datasets, MLP-like architectures and GBDT show the best results, while more sophisticated DL models are yet to prove their effectiveness.

dataset, tabular, timesplit needed, (16 more...)

arXiv.org Artificial Intelligence

2406.1938

Country:

North America > United States > Missouri > Jackson County > Kansas City (0.14)
North America > United States > New York (0.04)
Europe > United Kingdom (0.04)
(7 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Leisure & Entertainment (1.00)
Consumer Products & Services (1.00)
Transportation > Passenger (0.93)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

EUROPA: A Legal Multilingual Keyphrase Generation Dataset

Salaün, Olivier, Piedboeuf, Frédéric, Berre, Guillaume Le, Hermelo, David Alfonso, Langlais, Philippe

arXiv.org Artificial IntelligenceJun-14-2024

Keyphrase generation has primarily been explored within the context of academic research articles, with a particular focus on scientific domains and the English language. In this work, we present EUROPA, a dataset for multilingual keyphrase generation in the legal domain. It is derived from legal judgments from the Court of Justice of the European Union (EU), and contains instances in all 24 EU official languages. We run multilingual models on our corpus and analyze the results, showing room for improvement on a domain-specific multilingual corpus such as the one we present.

computational linguistic, keyphrase, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2403.00252

Country:

Asia > China > Heilongjiang Province > Daqing (0.04)
Europe > Italy > Tuscany > Florence (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(19 more...)

Genre: Research Report (0.84)

Industry:

Law (1.00)
Government > Regional Government > Europe Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Information Management (0.68)

Add feedback

Clustering-Based Validation Splits for Domain Generalisation

Napoli, Andrea, White, Paul

arXiv.org Artificial IntelligenceMay-29-2024

This paper considers the problem of model selection under domain shift. In this setting, it is proposed that a high maximum mean discrepancy (MMD) between the training and validation sets increases the generalisability of selected models. A data splitting algorithm based on kernel k-means clustering, which maximises this objective, is presented. The algorithm leverages linear programming to control the size, label, and (optionally) group distributions of the splits, and comes with convergence guarantees. The technique consistently outperforms alternative splitting strategies across a range of datasets and training algorithms, for both domain generalisation (DG) and unsupervised domain adaptation (UDA) tasks. Analysis also shows the MMD between the training and validation sets to be strongly rankcorrelated (ρ = 0.63) with test domain accuracy, further substantiating the validity of this approach.

accuracy, dataset, validation, (13 more...)

arXiv.org Artificial Intelligence

2405.19461

Country: Europe > United Kingdom > England > Hampshire > Southampton (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)

Add feedback